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(54) Title: METHOD FOR PRODUCING RECOMBINANTS INTENDED FOR USE IN A COMPLETE MALARIA ANTIGENE 
GP190/MSP1 

(54) Bezeichnung: REKOMBINANTES HERSTELLUNGSVERFAHREN FOR EIN VOLLSTANDIGES MALARIA-ANTIGEN 
GP190/MSP1 

(57) Abstract 

The present invention relates to a method for producing recombinants intended for use in the complete cell-surface protein gpl90/MSPl 
from Plasmodium, especially Plasmodium falciparum, as well as the complete DNA sequence of this protein and the appropriate host 
organisms suited for expressing said sequence, whereby the protein concerned can be entirely synthesized outside the parasite. Also, the 
inventive method enables sufficient production of above-mentioned protein and its supply as a vaccine. Finally disclosed is a process for 
stabilizing genes with high At concentration. 

(57) Zusammcnfassung 

Die Erfindung betrifft ein rekombinantes Herstellungsverfahren flirdas komplette gpl90/MSPl-Oberflachenprotein von Plasmodium, 
insbesondere Plasmodium falciparum, sowie die vollstandige DNA-Sequenz dieses Proteins und geeignete Wirtsorganismen far die 
Expression der Sequenz, wodurch das Protein in seiner Gesamtheit erstmals auBerhalb des Parasiten synthetisiert werden konnte. Die 
Erfindung erSffnet erstmals die Mdglichkeit, das gpl90/MSPl-Oberflachenprotein in ausreichender Menge herzusteilen; ferner ist es 
ein Gegenstand der Erfindung gp!90/MSPl als Impfstoff zur Verfugung zu stellen. SchlieBlich gibt die Erfindung ein Verfahren zur 
Stabilisierung von AT-reichen Genen an. 
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Summary 

Recombinant manufacturing process for a complete malaria antigen gp190/MSP1 

The invention concerns a recombinant manufacturing process for the complete gp!90/MSP1 
surface protein of Plasmodium, in particular P. falciparum, as well as the complete DNA 
sequence of this protein and suitable host organisms for expression of the sequence, by 
means of which the protein in its entirety can for the first time be synthesized outside the 
parasite. 

The invention opens up for the first time the possibility of manufacturing the gp190/MSP1 
surface protein in sufficient quantity; in addition there is the matter of making gp190/MSP1 
available as a vaccine. 

Finally the invention sets out a process for the stabilization of AT-rich genes (Fig. 1). 
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Recombinant process for preparing a complete malaria antigen, gp190/MSP1 

The invention concerns a recombinant manufacturing process for the complete malaria 
antigen gp190/MSP1, as well as separate naturally-occurring domains and parts of the same, 
by expression of a synthetic DNA sequence. The invention concerns in addition the DNA 
sequences produced by the process and the host organisms used for the expression of the 
DNA sequences. In addition the invention concerns the use of the complete malaria antigen 
as well as parts thereof as a vaccine for immunization against malaria. 

Finally the invention under consideration concerns a stabilization process 

for AT-rich genes, as well as stabilized genes which are characterized by a reduced AT 

content. 

Malaria is one of the most significant infectious diseases in the world. According to WHO 
reports, in 1990 40% of the world population in 99 countries was exposed to the risk of 
malaria. At the same time its distribution is enormously on the increase. This may be ascribed 
above all to intensive development of resistance in the parasites causing malaria, promoted 
by the recommendation and use as prophylactics of the drugs intended for treatment. Besides 
the search for new and effective chemotherapeutic agents hope is nowadays directed towards 
the development of vaccines, since people in areas of the world where malaria is epidemic do 
manage to develop some kinds of immunity. As well as a natural resistance to malaria, such 
as that found in heterozygous carriers of the sickle-cell gene and people with thalassaemia 
and glucose-6-phosphate dehydrogenase deficiency, in the course of malarial infection in 
humans immune mechanisms can be stimulated which express themselves in a heightened 
capacity for resistance to the Plasmodia. Consequently the course of the disease in 
populations exposed to severe epidemics is generally less threatening than in persons 
exposed to the infection less frequently or for the first time. 

The main problem in the development of a vaccine is the identification of an antigen which can 
induce protective immunity, since there is no easily accessible well-defined animal model 
available for the four parasites affecting man. The organism causing malaria belongs to the 
Plasmodium group, of which infection with one of the four parasites Plasmodium vivax, 
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Plasmodium ovale, Plasmodium malariae or Plasmodium falciparum results from the bite of 
Anopheles mosquitoes. Of these parasites Plasmodium falciparum is the most dangerous and 
the most widely distributed. 

The main surface protein of the merozoite, the invasive form of the blood stage of the malaria 
parasite Plasmodium falciparum and other malaria parasites such as P. vivax, is a 190-220 
kD glycoprotein. Late in the development of the parasite this precursor is processed into 
smaller proteins, which can however be isolated from merozoites as a unitary complex. By 
means of a glycosylphosphatidyl-inositol bond this complex is coupled to the merozoite 
membrane. The sequences of the gp190 proteins of various P. falciparum strains fall into two 
groups, between which intragenic recombination is frequent. In general the protein consists of 
many highly conserved regions, of a dimorphic zone to which in each case one of two alleles 
belongs, and of two relatively small oligomorphic blocs in the N-terminal zone (Tanabe, K., 
Mackay, M. f Goman, M. and Scaife, J.G. (1987), Allelic dimorphism in a surface antigen gehe 
of the malaria parasite Plasmodium falciparum, J. Mol. Biol. 195, 273-287, Miller, L.H., 
Roberts, T., Shahabuddin, M. and McCutchan, T.F. (1993), Analysis of sequence diversity in 
the Plasmodium falciparum merozoite surface protein-1 (MSP-1). Mol. Biochem. Parasitol. 59, 
1-14). 

Already early on gp190/MSP1 was considered as a possible candidate for a vaccine. In the 
rodent model active protection against infection with rodent parasites was obtained following 
immunization with the analogous protein. Passive protection could be procured with antibodies 
directed against this protein (see also Holder, A.A. and Freeman, R.R. (1981), Immunization 
against blood-stage rodent malaria using purified parasite antigens, Nature 294, 361-364; 
Marjarian, W.R., Daly, T.M., Weidanz, W.P. and Long, C.A. (1984), Passive immunization 
against murine malaria with an lgG3 monoclonal antibody, J. Immunol. 132, 3131-3137). The 
data which ought to support this assumption are nevertheless in details not statistically 
significant. 

There are in addition, a number of monoclonal antibodies which in vitro inhibit the invasion of 
erythrocytes by P. falciparum and are directed against gp190/MSP1 (Pirson, P. J. and Perkins, 
M.E. (1985), Characterization with monoclonal antibodies of a surface antigen of Plasmodium 
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falciparum merozoites. J. Immunol. 134, 1946-1951; Blackman, M.J., Heidrich, H.-G., 
Donachie, S., McBride, J.S. and Holder, A.A. (1990), A single fragment of a malaria merozoite 
surface protein remains on the parasite during red-cell invasion and is the target of invasion- 
inhibiting antibodies. J. Exp. Med. 172, 379-382). 

Finally, a series of vaccine studies have been carried out with gp190/MSP1 from P. falciparum 
on primates, particularly on Aotus and Saimiri monkeys (see also Perrin, L.H., Merkli, B., 
Loche, M., Chizzolini, C, Smart, J. and Richie, R. (1984), Antimalarial immunity in Saimiri 
monkeys. Immunization with surface components of asexual blood stages, J. Exp. Med. 160, 
441-451; Hall, R., Hyde, J.E., Goman, M., Simmons, D.L., Hope, I.A., Mackay, M. and Scaife, 
J.G. (1984), Major surface antigen gene of a human malaria parasite cloned and expressed in 
bacteria, Nature 311, 379-382; Siddiqui, W.A., Tarn, L.Q., Kramer, K.J., Hui, G.S.N. , Case, 
S.E., Yamaga, K.M., Chang, S.P., Chan, E.B.T. and Kan, S.-C. (1987), Merozoite surface coat 
precursor protein completely protects Aotus monkeys against Plasmodium falciparum malaria, 
Proc. Natl. Acad. Sci. USA 84, 3014-3018; Ettlinger, H.M., Caspers, P., Materile, H., 
Schoenfeld H.-J., Stueber, D. and Takacs, B. (1991), Ability of recombinant or native proteins 
to protect monkeys against heterologous challenge with Plasmodium falciparum, Inf. Imm. 59, 
3498-3503; Holder, A.A., Freeman, R.R. and Nicholls, S.C. (1988), Immunization against 
Plasmodium falciparum with recombinant polypeptides produced in Escherichia coli, Parasite 
Immunol. 10, 607-617; Herrera, S., Herrera, M.A., Perlaza, B.L., Burki, Y., Caspers, P., 
Doebeli, H., Rotmann D. and Certa, U. (1990), Immunization of Aotus monkeys with 
Plasmodium falciparum blood-stage recombinant proteins, Proc. Natl, Acad. Sci. USA 87, 
4017-4021; Herrera, M.A., Rosero, F., Herrera, S., Caspers, P., Rotmann, D., Sinigaglia, F. 
and Certa, U. (1992), Protection against malaria in Aotus monkeys immunized with a 
recombinant blood-stage antigen fused to a universal T-cell epitope; correlation of serum 
gamma interferon levels with protection, Inf. Imm. 60, 154-158; Patarroyo, M.E., Romero, P., 
Torres, M.L., Clavijo, P., Moreno, A., Martinez A., Rodriquez, R., Guzmann, F. and Cabezas, 
E. (1987), Induction of protective immunity against experimental infection with malaria using 
synthetic peptides, Nature 328, 629-632). In these vaccine studies two premises may be 
distinguished: 

- Use of material isolated from parasites, and 
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- Administration of material procured in heterologous systems of expression. 

The latter consists as a rule of relatively small segments of the total protein. Although the 
results of the inoculations carried out preliminarily on monkeys indicate that gp190/MSP1 
could bring about protection, all the experiments carried out on primates have two problems, 
which place such a conclusion in question: 

(a) they were carried out on too small groups of animals 

(b) they were not repeated. 

The results and the conclusions drawn from them are consequently not statistically confirmed. 
Besides the difficulty of access to suitable monkeys there remains the main basic problem, 
that it has so far not been possible to manufacture good vaccination material in a suitable 
quantity. 

On the other hand, after the sequencing of the gp190 gene from the K1 and MAD20 strains of 
Plasmodium falciparum overlapping fragments could be expressed in E. coli. With this 
material epidemiological studies in West Africa showed that in the adolescent group a 
correlation existed between antibody titre against gp190/MSP1 fragments on one hand and 
protection from parasite infection on the other. In addition the titre also appeared to correlate 
with the capacity to control the parasitaemia even at a low level (Tolle et al. (1993): A 
prospective study of the association between the human humoral immune response to 
Plasmodium falciparum blood stage antigen gp190 and control of malarial infections, Infect. 
Immun. 61 , 40-47). These results are supplemented by new investigations on Aotus monkeys 
in the framework of the present invention. Here an enhanced protection against infection with 
the parasite was attained because protein preparations from Plasmodium falciparum, which 
consisted predominantly of unprocessed gp190/MSP1, had been used as vaccine. The 
monkeys with the highest antibody titres against gp190/MSP1 were the best protected. These 
results eventually indicated gp190 as a most promising candidate for a vaccine against 
tropical malaria. 
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By some groups of workers the C-terminal domain 
of gpl90 (pl9 or p42) is assigned a particular role in the 
immunity mediated by gpl90 (see also Chang, S.P., Case, 
S.e., Gosnell, W.L., Hashimoto, A., Kramer, K.J., Tarn, 
L.Q., Hashiro, C.Q., Nikaido, CM., Gibson, H.L., Lee-Ng, 
C.T., Barr, P.J., Yokota, B.T. and Hui, G.S.N. (1996), A 
recombinant baculovirus 42-kilodalton C-terminal fragment 
of Plasmodium falciparum merozoite surface protein 1 
protects Aotus monkeys against malaria, Inf. Imm. 64, 253- 
261; Burghaus, P. A. , Wellde, B.T. / Hall, T . , Richards, 
R.L., Egan, A.F., Riley, E.M., Ripley-Ballou, W. and Holder 
A. A. (1996), Immunization of Aotus nancymai with 
recombinant C-terminus of Plasmodiu, falciparum merozoite 
surface protein 1 in lipsomes and alum adjuvant does not 
induce protection against a challenge infection, Inf. Imm., 
in press . 

Thus far, however, it has also been impossible to 
exclude other parts of gpl90 on a rational basis as 
irrelevant to a protective immune response. Hence it is as 
necessary as ever to use the entire gene or the intact 
gpl90 for vaccine investigations. Despite multiple 
investigations by various work-groups, however, there has 
not yet been any success in cloning and expressing the 
entire gpl9 0/MSPl gene. 

Nor has it so far been possible to exclude a 
priori any part of the gpl9 0 sequence as irrelevant to the 
protective immune response, so that it is as necessary as 
ever to use the entire gene or gene product for vaccine 
investigations. Nevertheless, despite many investigations 
by a number of working groups there has not yet been any 
successful cloning of the whole gene for gpl90/MSPl. 

The present invention provides a means of making 
available an adequate quantity of vaccine material in the 
form of the complete gpl90/MSPl. The present invention 
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further provides a process by which this vaccine material 
could be recovered. 

In addition the present invention provides a 
complete DNA sequence of gpl9 0/MSPl which could be 
expressed in a host organism. 

The present invention also provides a host 
organism containing the complete gpl90/MSPl gene. 

Finally, the present invention provides a 
stabilisation process for AT-rich genes, as well as a 
stabilised gene suitable for expression characterised in a 
reduction of the AT content. 

In the following, certain concepts are explained 
in more detail in order to make clear how they should be 
understood in this context. 



2 0 Throughout the description and claims of this 

specification, the word "comprise" and variations of the 
word, such as "comprising" and "comprises", means 
"including but not limited to", and is not intended to 
exclude other additives, components, integers or steps". 



15 
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"Recombinant manufacturing process" means that a 
protein of a DNA sequence is expressed by a suitable host 
organism in which the DNA sequence has arisen from cloning 
and fusion of individual DNA fragments. 



"Complete gpl9 0/MSPl protein" here means the 
entire gpl90/MSPl surface protein isolatable from the above 
named Plamodia, especially Plamodium falciparum, 
representing the main surface protein. of the above named 
3 5 parasite as well as the proteins with analogous function 
from the Plasmodium species such as p. vivax. The term 
therefore comprises in each case the main surface protein 
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of the merozoites of the four malaria parasites named above 
as dangerous to man. "Complete gpl90/MSPl gene" means the 
gene coding for this protein. In this context "complete" 
signifies that the entire amino-acid sequence of the native 
protein is present or that the gene sequence codes for the 
entire amino-acid sequence of the native protein. Mutated 
and/ or shortened forms of gpl9 0/MSPl are however included 
therewith insofar a they display the same immunisation 
potential (vaccine protection) as the complete gpl90/MSPl. 
Finally the term also includes variants of gpl90/MSPl 
characterised by containing in one molecule protein 
fragments of various alleles. 

"FCB-1" is a strain of P. falciparum such as that 
described in Heidrich, H.-G., Miettinen-Baumann, A., 
Eckerskorn, C. and Lottspeich, F. (1989) The N-terminal 
amino acid sequences of the Plasmodium falciparum (FCBl) 
merozoite surface antigens of 42 and 36 kilodalton, both 
derived from the 185-195-kilodalton precursor. Mol . 
Biochem. Parasitol. 34, 147-154. 

''Attachment signal" here means a protein 
structure for by a DNA sequence at the 3' or 5' end of the 
gene according to the invention. Attachment signals are 
structures enabling the attachment of a polypeptide to 
other structures, such as for example membranes. 

"Signal peptide" here signifies a protein 
structure coded for which a DNA sequence at the N-terminal 
end of the gene according to the invention codes . Signal 
peptides are structures which among other things enable 
penetration of the polypeptide into membranes. 

In the context of the present invention "AT- 
content" means the percentage amount of adenine -thymine 
base pairs compared to guanine-cytosine base pairs. 
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"Cloning" will comprehend here all known state- 
of-the-art cloning methods which could be applied here, but 
which are nevertheless not all described in detail because 
they belong among the normal tools of the person skilled in 
the art . 

"Expression in an appropriate expression system" 
should here include all known state-of-the-art methods of 
expression in known expression systems which could be 
applied here, but which are nevertheless not all described 
in detail because they belong among the normal tools of the 
person skilled in the art. 

The present invention provides a process by which 
the protein pgl90/MSPl and its gene can be produced in 
sufficient quantity without excessive cost. 

For the first time it is possible by this process 
to synthesise the protein in its entirety outside the 
parasite. As the analysis with conformational epitope- 
recognising monoclonal antibodies shows, the protein thus 
synthesised is at least reproducibly synthesisable over 
wide areas in naturally folded form. By the recombinant 
manufacturing process many milligrams of intact gpl90/MSPl 
could in every case be recovered from the host organism, a 
quantity which for 
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technical and economic reasons can never be recovered from parasites. Production of the 
protein in any desired quantity is now possible and opens new perspectives for its use as an 
experimental vaccine against malaria. Furthermore, the way is now open for the development 
of living vaccines as well as for vaccines based on nucleic acids. 

Synthesis of the gene sequence coding for the protein gp190/MSP1 is preferentially based on 
the sequence of the FCB-1 strain of P. falciparum. P. falciparum is the agent of tropical, 
malaria and hence of the most dangerous among the types of malaria. The basic gene is a 
representative of the "K1 allele", where K1 stands for a particular P. falciparum strain. Its 
coding sequence extends over 4917 base pairs and includes a signal sequence at the N- 
terminal end as well as an attachment sequence at the C-terminal end. 

Furthermore, according to the invention the recombinant manufacturing process is 
preferentially characterized in having the AT content of the QNA sequence on which the 
protein is based reduced relative to the wild type, from 74% in the original gene preferably to 
about 55%, for example while the amino-acid sequence of the FCB-1 protein is maintained a 
DNA sequence with the codon frequencies usual in the human genome is produced. Other 
codon frequencies which reduce the AT content are also conceivable. 

Preferentially the gene underlying the protein produced by the recombinant manufacturing 
process codes for the full amino-acid sequence including signal peptide and GPI attachment 
signal peptide, further described as gp190 s . 

In another preferred embodiment, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
except for the GPI attachment signal. This embodiment is then described as gp190 s1 . 

In yet another preferred embodiment, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
except for the GPI attachment signal and the signal peptide. This embodiment is then 
described as gp190 S2 . 
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In a further preferred embodiment type, the gene on which the protein produced by the 
recombinant manufacturing process is based codes for the complete amino-acid sequence 
and a trans-membrane attachment sequence. 

In a particularly preferred embodiment the recombinant manufacturing process includes the 
following steps: 

In the first place the design of the DNA sequence to be synthesized on the basis of the gene 
from P. falciparum FCB-1 , in which a DNA sequence with for example the codon frequencies 
common in the human genome is manufactured with retention of the amino-acid sequence of 
the FCB-1 protein. 

The AT content of the gene should be reduced by this, preferably to 55%. Further on in the 
process the planned sequence is divided for example into 5 overlapping regions, which at the 
same time correspond to domains of the natural processing products of gp190/MSP1 from 
FCB-1: p83, p31, p36, p30 and p19. 

Desoxyoligonucleotides are synthesized, which in each case extend the entire length of a 
region. 

The desoxyoligonucleotides so synthesized are particularly preferred where their sequence 
corresponds in an alternating manner to the "upper" (5' - 3') or the "lower" (3' - 5') DNA 
strand. The length of these oligonucleotides is preferably on average 120 nucleotides and 
they overlap the neighboring sequences in each case by about 20 bases. 

In one possible embodiment DNA sequences of about double the length of the existing end- 
products are manufactured by asymmetrical PCR, in effect so that the superfluous DNA 
sequences nearby in each case represent the opposite strand. This leads in a second 

PCR amplification cycle to a second product corresponding to the length of four originally 
inserted oligonucleotides excluding the overlapping region. Transfer of these products to a 
preparation consisting predominantly of single-stranded DNA by asymmetrical PCR with the 



terminal oligonucleotides permits the manufacture in a 
further amplification step of an 800-bp long double- 
stranded DNA fragment in only 25 PGR cycles. 

In this manner the regions coding for pl9, p3 0, 
p36 and p31 are directly synthesised and molecularly cloned 
in E. coli. Clones with fault-free sequences are conserved 
either directly or by the joining up of fault-free sequence 
fragments. The region which codes for p83 is constructed 
by fusion from two sequences comprising about 1200 bp. 

In the further course of production single 
sequences are cloned. As expression vectors candidates 
preferred are the plasmids pDS56, RBS11 ("Hochuli, E., 
Bannwarth, W. , Doebeli, H., Gentz, R. and Stueber, D. 
(1988) Genetic approach to facilitate purification of 
recombinant proteins with a novel metal chelate adsorbent. 
Biqtechn. 6, 1321-1325"), pBi-5 (« Baron, U. , Freundlib, S., 
Gossen, M. and Bujard, H. (19 95) Corregulation of two gene 
activities by tetracycline via a bidirectional promoter. 
Nucl. Acids Res. 23, 3605-3606") and ppTMCS . It is 
possible nonetheless also to conceive of other expression 
vectors . 

Host organisms preferred for expression are E. 
coli, with the strain DH5alphaZl specially preferred (R. 
Rutz, Dissertation 1996, Heidelberg University), HeLa 
cells, CHO cells, Toxoplasma gondii ( Pf ef f erkorn, E.R. and 
Pfefferkorn, C.C. 197 6, Toxoplasm gondii : Isolation and 
preliminary characterisation of temperature-sensitive 
mutants. Exp. Parasitol. 39, 365-376) or Leishmania. 
Additional host systems might be e.g. yeasts, baculoviruses 
or adenoviruses, so that the subject matter of the 
invention should not be limited to the host systems 
mentioned. 

The present invention also provides a complete 
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DNA sequence, suitable for expression, of the gpl9 0/MSPl 
surface protein of P. falciparum. 

In a preferred embodiment of the present 
invention the sequence suitable for expression codes for 
the complete amino-acid sequence. 



\\melb_£iles\home$\Bkrot\Keep\speci\48649-97 .doc 21/06/00 



11 



In another preferred embodiment of the present invention the sequence suitable for 
expression codes for the complete amino-acid sequence except for the attachment signal. 

In a further preferred embodiment according to the present invention the DNA sequence 
suitable for expression codes for the complete amino-acid sequence except the attachment 
signal and the peptide signal. This embodiment of gp!90/MSP1 can hence be characterized in 
including at the N-terminus 1 1 additional amino-acids, of which 6 are histidines. 

Particularly preferred the DNA sequence suitable for expression contains no recognizable 
"splice-donor" and "splice-acceptor" sites, and is preferably characterized in not containing 
any larger GC-rich sequences which might result in stable hairpin structures at the RNA level. 

Recognition signals for restriction enzymes which recognize sequences of six or more base 
pairs should preferably be avoided. 

In a preferred embodiment specific cleavage sites for restriction endonucleases, occurring 
only once in the gene, are introduced into regions to separate the existing domains following 
processing of the protein. 

Particularly preferred would be the presence at both ends of the gene of sequences for 
restriction endonucleases which do not occur in the gene. 

Furthermore host organisms containing the complete sequence of gp190/MSP1 surface 
protein are provided by the invention. 

Such host organisms are preferably E. coli, particularly preferred being the strain 
DH5alphaZ1, HeLa cells, CHO cells, Toxoplasma gondii or Leishmania. The HeLa and CHO 
cells ought preferably to synthesize constitutively tTA. 
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Finally the present invention provides a possibility of using a gp190/MSP1 surface protein 
created produced according to the recombinant manufacturing process, or parts thereof, for 
active immunization against malaria. 

The scheme for synthesis presented here also permits manufacture of the second allele of the 
gp190/MSP1 gene, whereby the dimorphism of the protein is also taken into account. The 
main variability of the protein depends however on the sequences of two relatively short blocs, 
blocks II and IV (ref. 1), which are oligomorphic. The present sequence data make it possible 
to disclose over 95% of all known gp190/MSP1 sequences with 6-8 sequence combinations of 
these blocs. The synthesis of these sequence variants can be brought about problem-free by 
means of the strategies proposed here, so that variants can be built up both in the K1 and in 
the MAD20 allele. Vaccines from the families of sequences thus created can confer protection 
where required against a wide spectrum of parasites with gp190/MSP1 variants. 

The manufacture of different types of vaccine is possible: 

- At the level of protein preparations, where in each instance mixtures of the two families (K1 
type, MAD20 type with different variants of Blocs II and IV) can come into use. Various carrier 
or adjuvant materials could be added: aluminum oxide, liposomes, IscomsQSzl , etc. 

- At the level of live vaccines: (a) viral carriers, especially vaccinia and adenoviruses; (b) 
parasites as carriers, particularly avirulent forms of Leishmania and Toxoplasma; (c) bacterial 
carriers, e.g. Salmonella. 

- At the level of nucleic acids, whereby for example vectors suitable for gene therapy would 
be used to introduce the gene into the host; beyond that the introduction of nucleic acids 
coding for the desired protein can be envisaged. 

A further possibility for vaccination lies in the use of a gp190/MSP1 protein produced 
according to the recombinant manufacturing process set out by the invention, for the 
production of monoclonal antibodies which can then be used in their turn for passive 
immunization against malaria. 
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Similarly it becomes possible to use the DNA sequence on which the protein is based at an 
intermediate stage arising in the course of the recombinant manufacturing process for the 
construction of a vaccine based on nucleic acids. 

Finally the invention also concerns a process for the stabilization of gene sequences, 
especially for sequences which do not show adequate stability in expression systems. 

According to the invention this stabilization is attained because the AT content of the 
sequence is reduced. 

Moreover a stabilized gene characterized by having a reduced AT content is provided by the 
invention. An example of such a stabilized gene is the gene for gp190/MSP1 surface protein 
according to the present invention. 

In the following the invention will be described with the help of figures and tables as well as 
some examples in individual embodiments. 

They show: 

Fig. 1: Schematic representation of the gp190/MSP1 precursor 
protein from P. falciparum (FCB-1). 

Fig. 2: Two vaccine trials carried out on Aotus monkeys with 
native gp190/MSP1 from P. falciparum (FCB-1). 

Fig. 2A: With 3 x 60 micrograms gp190/MSP1 

Fig. 2B: With 3 x 40 micrograms gp190/MSP1 

Fig. 3A: Strategy of synthesis of the gp190/MSP1 gene 
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Fig. 3B: Principle of PCR-basecl total synthesis 

Fig. 3C: Total sequencing of gp190 s 

Fig. 3D: N- and C-termini of gp190 s1 variant 

Fig. 4A: Expression vector pDS56 with gp190 s2 sequence 

Fig. 4B: Gel electrophoresis of gp190 S2 

Fig. 5A: Expression vector pBi-5 with gp190 s1 sequence 

Fig. 5B: Immunofluorescence of HeLa cells 

Fig. 5C: Electrophoretic characterization of gp190 s1 
purified from HeLa cells 



Fig. 6A: Expression vector ppT 190 with gp190 sequence 

Fig. 6B: Immunofluorescence of the expression of gp190 s 
in T. gondii 

Fig. 6C: Polyacrylamide gel electrophoresis of gp190 from 
T. gondii 

In the gp190/MSP1 precursor protein from P. falciparum schematically represented in Fig. 1 
the dark blocs stand for regions which are strongly conserved in all strains. The cross-hatched 
blocs indicate the dimorphic areas, which in the case of the FCB-1 isolate derive from the K1 
allele. 01 and 02 indicate the oligomorphic areas. S denotes the peptide signal sequence 
containing 19 amino-acids, GA the C-terminal region, which includes the signal for the GPI 
attachment of the protein to the membrane. The arrows indicate the sites of the processing by 
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which the proteins p53, p31, p36, p30 and p19 arise. The gp190 gene codes for altogether 
1639 amino-acids. 

The other figures are more conveniently explained in the context of the following Examples. 
EXAMPLES 

Example 1: total synthesis of one of the DNA sequences coding for qp190/MSP1 (see Fig. 3) 

A. Strategy of synthesis of the qp190/MSP1 gene (gp190 s ) (see Fig. 3A). 

The sequence was divided into fragments corresponding to the main processing products: 
p83, p31, p36 t p30 and p19. In the transition regions cleavage sites for restriction 
endonucleases (arrows in fig. 3) were inserted in such a way that the amino-acid sequence 
was not altered. All the particular cleavage sites are found only once in the sequence. 

The fragments were synthesized to overlap, so that the cleavage sites at the respective ends 
made attachment by ligation to the neighboring fragment possible. All individual fragments 
contain in addition at their 5' ends a BamHI division site for insertion into expression vectors. 
The entire sequence could be cloned via Mlul and Clal. The scheme indicated here leads in 
addition to a sequence which cannot produce the GPI attachment since the C-terminal lacks 
18 amino-acids. Synthesis of a corresponding oligonucleotide as well as of a "primer" 
extending over the Sphl cleavage site, leads after PCR to the GA fragment which could be 
used by Sphl and Clal, the resulting total sequence being gp190 s . On removing the sequence 
coding for the peptide signal, "PCR Primer" is produced, over which the fragment AS has 
been synthesized. It is permissible to alter the N-terminal via a BamHI and a Hindlll cleavage 
site in such a way that the protein begins with amino-acid no. 20. The nuclear sequence which 
encodes gp190/MSP1 without signal sequence and without GPI attachment signal was 
designated gp190S2. Deletion of the GPI attachment signal alone leads to gp190 s1 . 
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B. Principle of the PCR-supported total synthesis (see Fig. 3B) 

Oiigoclesoxynucleotides of about 120 nucleotides have been synthesized in an alternating 
manner from the coding or the non-coding strand in such a way that in each case about 20 
bases overlapped with the neighboring fragment The scheme illustrates for example the 
synthesis of a fragment about 800bp long from oligonucleotides. At the first stage 2 
oligonucleotides were amplified "asymmetrically" in each of 4 reaction vessels. This resulted in 
4 populations of DNA about 220bp in length, consisting predominantly of single strands (A, B, 

C, D). Uniting A to B and C to D with amplification over 5 cycles led to 2 approximately 400bp 
long double-stranded products. Asymmetrical amplification of these DNA fragments (Stage III) 
resulted in single-stranded populations which following uniting and amplification (Stage IV) 
resulted after 10 cycles in the end-product G of about 800bp in length. This synthesis could be 
carried out without isolation of intermediate products and without renewing buffer or enzyme, 
and was completed in 3 hours. The end-product was purified electrophoretically, divided up 
with the appropriate restriction endonucleases, and cloned in E. coli in pBluescript 
(Stratagene), to which polylinker a Mlul and a Clal cleavage site had been added. 

C. Total sequence of gp190 s (see Fig. 3C) 

Following fusion of all part sequences (Fig. 3A) in pBluescript, the sequence of the gene was 
checked by the di-deoxy method. The reading frame of gp190 s had a length of 491 7bp (+ 2 
stop-codons) and encoded an amino-acid sequence corresponding to that of the gp190/MSP1 
from FCB-1 (1639 amino-acids). 

D. N- and C-termini of the qp190 s1 variant (see Fig. 3D) 

The N-terminal sequence, beginning with the BamHI cleavage site, indicates the transition at 
amino-acid 20, from which it can be assumed that after splitting of the signal peptide it defines 
the N-terminus. At the C-terminus the sequence encoded ended at amino-acid 1621. The 
stop-codon followed the Clal cleavage site. 
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Example 2: Expression of qp190 S2 in E. coli 

A. Expression vector (see Fig. 4A) 

The gp190 sz sequence was inserted via the BamHI and Cial cleavage sites into pDS56RBSII, 
by means of which 6 histidines as well as some amino-acids originating in the vector were 
fused to the N-terminus. This produces the following N-terminal sequence on the reading- 
frame: Met Arg Gly Ser (His) 6 Gly Ser. Through the promoter P N 2siaco-i the transcription comes 
under lacR/O/IPTG control. 

B. Expression and purification of qp190 S2 (see Fig. 4D) 

Carrying over the vector pDS56RBSIIgp1 90 s2 into E. coli DH5alphaZ1 and induction of 
synthesis through IPTG resulted after electrophoretic separation of the total protein extract 
from the culture in a clearly visible band of the anticipated size (arrow). Purification of the 
material through I MAC and affinity chromatography (antibody column with mAK5.2) led to a 
homogeneous product of about 190 kD. In the Figure M stands for molecular weight 
standards; 1 = E. coli before; 2 = after induction with IPTG for 2 hours; 3, 4, 5 = fractions from 
elution of the mAK column. 

Example. 3: Tetracvcline-controlled expression of qp190 s1 in HeLa and CHO cells and 
isolation of the product (see also Fig. 5 and 6c) 

A. The gp190 sequence was inserted via the BamHI/Clal cleavage sites into the expression 
vector pBi-5. In this way transcription of the gene came under the control of a bidirectional 
"tTA-reponsive" promoter and could be regulated through Tc. The bidirectional promoter 
simultaneously initiated transcription of the indicator gene luciferase. In consequence the 
regulation of the expression could easily be followed (see also Fig. 5A). 
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B. Immunofluorescence of HeLa cells, which express luciferase and gp190 s1 under Tc control 

The production of luciferase (left), gp190 s1 (middle) in the absence of Tc was demonstrated in 
HtTA93-9 cells, which contain the bidirectional transcription unit of (A). Following addition of Tc 
no noteworthy synthesis of gp190S1 was shown (as represented in Fig. SB.right). 

C. Electrophoretic characterization of qp190 s1 purified from HeLa cells 

The HeLa cell clone HtTA93-9 as well as the CHO cell clone CH027-29 have been cultivated 
with or without Tc. Cell extracts separated by electrophoresis have been analyzed with 
mAK5.2 by means of "Western blotting" (Fig. 5C); analysis of the CHO cell line is shown on 
the left, of HeLa on the right. (1) = culture without, (2) = culture with Tc, (3) = non-transfected 
HtTA-1 cell line. Molecular weight standards are in each case indicated on the left. 

D. Purification of gp190 s1 svnthesised by HeLa cell clone HtTA93-9 

Preparative cultivation of the HtTA line and induction of expression of gp190 s1 by withholding 
Tc permitted isolation of the gene product by affinity chromatography (mAK5.2 column). 

The polyacrylamide gel stained with Coomassie (Fig. 6C) following electrophoresis displayed a 
product consisting of gp190 s1 as well as another protein of about 50 kD. The latter was not 
derived from gp190 s1 and thus originated from the HeLa cells. Its projected removal should 
nevertheless present no difficulty in principle. 

Example 4: Expression of gp190 s1 in Toxoplasma gondii and purification of the product (see 
also Fig. 6). 

A. The gp190 s sequence was inserted into the vector ppT via Mlul/Pstl. This brought the gene 
under the control of the tubulin promoter (P tU b-i) of T. gondii. The 3* untranslated region (VTR) 
originated from the main surface protein of T. gondii (SAG-1). 
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B. Expression of qp190 s in T. gondii 

Transfection of T. gondii with pTT190 led to the isolation of parasite lines which expressed 
constitutively gp190 s Immunofluorescence with mAK5.2 (middle picture) showed not only 
expression of the gene but also situated the binding of the expression product close to the 
surface of the parasite, since it, like SAG-1 , provokes the same pattern of 
immunofluorescence (right section of fig. 6B). On the left in Fig. 6B a phase contrast 
photograph of the middle picture is shown. 

C. Isolation of gp190S from T. gondii. 

By means of affinity chromatography (mAK5.2 column) gp190 s was purified from a prepared 
quantity of T. gondii (5 x 10 9 parasites). The extremely pure protein possessed the anticipated 
molecular weight, as the Coomassie-stained polyacrylamide gel indicated following 
electrophoresis (2-3 on Figure 6C). At no. (1) on Fig. 6C purified gp190 s1 from CHO cells is 
represented with molecular weight marked on the left side. 

Example 5: Characterization of gp190 s with monoclonal antibodies. 

The interaction of 16 monoclonal antibodies with gp1 90 s from the various heterologous 
expression systems was reviewed by immunofluorescence on P. falciparum and T. gondii or 
by "Western blot" on the purified proteins. Complete agreement was found when the two 
parasites were compared (number of +s indicates the relative intensity of the fluorescence). 
On Western blotting 12mAK's reacted with gp190 s from E. coli and T. gondii. On the other 
hand 3 antibodies did not bind to material isolated from CHO cells. Antibodies 15 and 16, 
which recognize epitopes from the oligomorphs or the alternative allele (MAD20), did not 
react. The results are summarized in Table 1 , in which ND means "not carried, out". 
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Example. 6: expression of gp190 s in heterologous systems 

1. Expression in E. coli 

The gp190 s2 was inserted into the expression vector pDS56, RBSII, where it came under 
control of the promoter PN 2 5iaco-i, which can be controlled via the lac operator/repressor/IPTG 
system (Fig. 4A). Transfer of the plasmid into repressor-producing E. coli cells, eg E. coli 
DH5alphaZ1, permitted expression of pg190 S2 under IPTG control. By means of a nickel- 
chelate column the raw product could be isolated via the N-terminal (His) 6 sequence 
introduced by the vector. An ensuing affinity chromatography on an antibody column led to an 
extremely pure preparation. Since the monoclonal antibodies used (mAK5.2) recognized a 
conformational epitope in the C-terminal region, this 2-step purification selected a full-length 
intact protein with correct folding at least at the C-terminus (Fig. 4B). 

In contradistinction to the natural material the end-product possesses 1 1 additional amino- 
acids at the N-terminus, of which 6 are histidines. It contains no N-terminal signal and also no 
C-termtnal attachment sequence. The P. falciparum-specific sequence begins with amino-acid 
20 and ends with amino-acid 1621 . 

2. Controlled expression of gp190 s1 in HeLa and CHO cell cultures 

The gp190S1 was inserted into the vector pBi-5 and thereby placed under control of a 
promoter regulable by tetracycline (Tc). The Tc-contolled system was chosen for 2 reasons: 

- It belongs to the expression systems with which the highest yield is obtained in mammalian 
cells. 

- Unsecreted foreign proteins at high concentration can interfere negatively with cell 
metabolism. Synthesis of the desired product is consequently begun only after maturation of 
the culture. 
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In the construct pBi5-gp190 s1 a bidirectional promoter was activated by the Tc-controlled 
transcription activator and initiated transcription of both gp190 s1 and the luciferase indicator 
gene. In the presence of Tc the promoter is inactive. The transcription unit was transferred into 
both HeLa and CHO cells, which both synthesize constitutively tTA (HtTA line: Gossen, M. and 
Bujard, H. (1992), Tight control of gene expression in mammalian cells by tetracycline- 
responsive promoters. Proc. Natl. Acad. Sci. USA 89, 5547-5551; CHO-tTA line, 
unpublished). Through cotransfection (Ca 2+ -phosphate method) with a hygromycin- 
resistance-inducing marker gene was selected for successful chromosomal integration. 
Hygromycin-resistant clones were then investigated for regulabiiity of the expression >Tc, in 
which luciferase activity was used as indicator. The gp190 synthesis was tested in well 
regulable clones (regulation factor *Tc 1000). Immunofluorescence analysis (Fig. 5B) as well 
as investigation by "Western blot" (Fig. 5C) allowed the identification in both cell types of 
clones which synthesized gp190 under strictly regulable conditions. The best regulable of 20 
clones were in each case subcloned. The subclones HtTA93-9 and CH027-29 were used for 
cultures on a scale of 10:1. From cell extracts of these cultures intact gp190 s1 could be 
isolated by means of affinity chromatography (mAK5.2). The material was homogeneous 
except for a single cellular component which did not derive from gp190 s1 and made up 25% of 
the preparation (Fig. 6C). It had to be removed in a further purification step. 

3. Expression of gp190S in Toxoplasma gondii. 

Like P. falciparum, Toxoplasma gondii belongs to the Apicomplexa and consequently has a 
protein modification system apparently similar to that of P. falciparum. T. gondii can be 
transfected with foreign DNA which is efficiently integrated into the genome and furthermore 
allows problem-free multiplication of T. gondii in cell culture. To obtain a product most like 
native gp190 t gp190S2 is expressed in such a way that the protein is secreted on the surface 
of the parasite and, as in P. falciparum, bound to the membrane via a GPI analogue. In that 
way the gp190S2 (Fig. 3A) has been inserted (Fig. 6A) into the plasmid ppTMCS (D. Soldati, 
unpublished) and thereby placed under the control of the T. gondii tubulin promoter, 

This expression construct was transfected into T. gondii. Selection with chloramphenicol led to 
resistant clones synthesizing gp190 which were detected by immunofluorescence (Fig. 6B). 
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The immunofluorescence with anti-gp190 antibodies was indistinguishable from a 
corresponding pigmentation of the parasites by means of antibodies against SAG1, the main 
surface protein of T. gondii. It may be deduced from this that gp190 is bound to the surface of 
T. gondii. Several T. gondii clones (Nos. 3.1 to 3.4) were characterized and saved for the 
production of gp190. Using affinity chromatography (mAK5.2) gp190 was isolated from T. 
gondii cultures (clone 3.4) cultivated on a preparative scale. Electrophoretic analysis revealed 
a homogeneous product with a migration rate similar to that of the intact protein (Fig. 6C) 

Example 7: Characterization of qp190 protein from various expression systems by means of 
monoclonal antibodies. 

A set of gp190-specific monoclonal antibodies, of which a number recognize conformational 
epitopes, were used to compare the reactivity of the antibodies with P. falciparum and T. 
gondii parasites via immunofluorescence. Table T shows that the reactivity of the 16 
antibodies with either parasite is the same. This is a strong indication that in T. gondii "native" 
gp190 is being mostly produced. Comparison of the reactivity of the antibodies with protein 
from E.coli, HeLa or CHO cells as well as T gondii shows also that most of the antibodies react 
with the 4 preparations. In particular the protein derived from E. coli recognizes more of the 
antibodies than that produced in mammalian cells. This is apparently a consequence of 
glycosylation in mammalian cells. 

Example 8: Immunization of Aotus lemurinus griseimembra monkeys with gp190/MSP from P. 
falciparum (FCB-1). 

Two independent immunization experiments (A, B) were carried out. In them in one instance 
(A) 1 .0 mg and in the other (B) 0.6 mg of very pure gp190/MSP1 was extracted from about 2 x 
10 11 parasites respectively. 

The protein was administered together with Freund's Adjuvant (FCA). The control group 
received only FCA. Immunization equally with the protein mixture or the adjuvant was done 
three times at intervals of 4 weeks. Two weeks after the last immunization each of the animals 
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was infected with 10 5 parasites (FVO strain) from a donor animal. Parasitaemia was measured 
daily. The results are summarized in Fig. 2. The symbols mean: 

T: that the animals were treated with resochin 

D: a dead animal 

Fig. 2A: individuals in the vaccinated group each received 
3 x 60 micrograms gp190/MSP1 

Fig. 2B: individuals in the vaccinated group each received 
3 x 40 micrograms gp190/MSPl 

While in the control group only 1/1 1 animals did not develop parasitaemia, this was 6/10 in the 
vaccinated group. The four animals in the vaccinated group who did develop a pronounced 
parasitaemia did so - in comparison to the control group - with an average delay of four days 
(exceeding the 2% limit of parasitaemia). 

These experiments indicate for the first time a highly significant protection by gp190/MSP1 
against infection with P. falciparum in a monkey model. The process according to the invention 
consequently permits a practical vaccine against malaria to be presented for the first time. 
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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 

1. A process for preparing the complete gpl9 0/MSPl 

protein of Plasmodium comprising the step of expressing a 
5 gene for gpl90/MSPl protein, wherein in the DNA sequence of 
the gene has a reduced AT content in comparison to the 
naturally occurring sequence. 



2. A process according to claim 1, wherein the 
10 Plasmodium is Plasmodium falciparum . 

3. A process according to claim 1 or claim 2, 
wherein the expression takes place within a host organism. 

15 4. A process according to any one of claims 1 to 3, 

wherein the DNA sequence is derived from the DNA sequence 
of Plasmodium falciparum strain FCB-1. 

5. A process according to any one of claims 1 to 4 , 
2 0 wherein the AT content is reduced from 74% to 55%. 

6. A process according to any one of claims 1 to 5, 
wherein the DNA sequence encodes the complete amino acid 
sequence including signal peptide and attachment signal. 

25 

7. A process according to any one of claims 1 to 5, 
wherein the DNA sequence encodes the complete amino acid 
sequence except for the attachment signal. 

30 8. A process according to any one of claims 1 to 5, 

wherein the DNA sequence encodes the complete amino acid 
sequence except for the attachment signal and the signal 
peptide . 

35 9. A process according to any one of claims 1 to 8 , 

wherein the process comprises the following steps: 

(a) Design of a DNA sequence based upon the DNA 
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sequence from Plasmodium falciparum FCB-1, wherein the DNA 
sequence takes in to consideration the degeneracy of the 
genetic code but still maintains the amino acid sequence; 

(b) Division of sequence into overlapping regions 
5 wherein regions consist of p83, p31ql, p36, gp30 and gpl9; 

(c) Synthesis of desoxyoligonucleotides, wherein 
each of these extend the whole length of a region ; 

(d) synthesis of the regions coding for gpl9, 

p3 0, p3 6, p31 by PCR and synthesis of the region coding for 
10 p83 by fusion of two sequences comprising approximately 
1200bp; 

(e) individual cloning of coding sequences; 

(f) fusion of the complete gene; and 

(g) expression in a suitable expression system. 

15 

10. A recombinant process according to claim 9, 

wherein the desoxyoligonucleotides synthesized in step (c) 
should be on average 120 nucleotides long and in each 
instance overlap the neighbouring sequences by about 2 0 
2 0 bases. 
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11. A recombinant process according to any one of 

claims 1 to 8, wherein the expression vector is 
dPS56,RBSII . 

12 . A recombinant process according to any one of 

claims 1 to 8, wherein the expression vector is pBi-5. 



13 . A recombinant process according to any one of 
3 0 claims 1 to 8 , wherein the expression vector is ppTMCS . 

14. A recombinant process according to any one of 
claims 1 to 13, wherein the DNA sequence is expressed in E. 
coli . 
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15. A recombinant process according to claim 14, 

wherein the E. coli strain used is the repressor-producing 
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strain E . coli DH5alphaZl. 

16. A recombinant process according to any one of 

claims 1 to 13 , wherein the DNA sequence is expressed in 
HeLa cells. 

17 . A recombinant process according to any one of 

claims 1 to 13 , wherein the DNA sequence is expressed in 
CHO cells. 

18. A recombinant process according to any one of 
claims 1 to 13 , wherein the DNA sequence is expressed in 
Toxoplasma gondii or Leishmania. 

19. A complete DNA sequence which encodes the 
gpl90/MSPl surface protein of Plasmodium, wherein the DNA 
has a reduced AT content compared to the natural ly- 
ocqurring DNA sequence. 

20. A complete DNA sequence according to claim 19, 
wherein the Plasmodium is Plasmodium falciparum. 

21. A DNA sequence according to claim 19 or claim 20, 
wherein the DNA sequence is derived from the DNA sequence 
of Plasmodium falciparum strain FCB-1. 

22. A DNA sequence according to any one of claims 19 
to 21, wherein the AT content is reduced from 74% to 55%. 

23. A DNA sequence according to any one of claims 19 
to 22, wherein the DNA does not code for the attachment 
signal . 

24. A DNA sequence according to claim 23, wherein the 
DNA does not code for the signal peptide. 

25. A DNA sequence according to claim 24, wherein the 
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DNA includes a sequence which encodes 11 additional amino- 
acids, of which 6 are histidines, present at the N- 
terminus . 

5 26. A DNA sequence according to any one of claims 19 

to 25, wherein the sequence includes no recognizable 
"splice donor" and "splice acceptor" signals. 

27. A DNA sequence according to any one of claims 19 

10 to 26, wherein the sequence includes no large GC-rich 
sequences . 
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28. A DNA sequence according to any one of claims 19 

to 27, wherein the sequence includes no recognition signals 
for restriction enzymes recognizing sequences of six or 
more base pairs . 



2 9^ A DNA sequence according to any one of claims 19 

to 28, wherein the sequence for recognition signals of 
20 particular restriction nucleases in regions which, after 
processing of the protein, separate existing domains, 
contains uniquely occurring cleavage sites for restriction 
endonucleases . 

25 30. A DNA sequence according to any one of claims 19 

to 29, wherein the sequence has at both its ends cleavage 
sites for endonucleases which do not appear in the 
remaining sequence and in a vector for utilization. 

3 0 31-. A host organism which contains the complete DNA 

sequence according to any one of claims 19 to 30. 

32. A host organism according to claim 31, wherein 

the organism is E. coli. 
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33. A host organism according to claim 32, wherein 

the E. coli strain is the repressor-producing E. coli 
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strain DH5alphaZl. 

34. A host organism according to claim 31, wherein 
the host organism is HeLa cells. 

5 

35. A host organism according to Claim 31, wherein 
the host organism is CHO cells. 

36. A host organism according to claim 31 or claim 
10 32, wherein the host cells synthesize constitutively tTA. 

37. A host organism according to claim 31, wherein 
the host organism is selected from the group consisting of 
Toxoplasma gondii, Leishmania, baculovirus, adenovirus and 

15 yeast. 

38. Use of a gpl90/MSPl protein produced by a process 
according to any one of claims 1 to 18 for active 
immunization against malaria. 

20 

39. Use of a gpl90/MSPl protein produced by a process 
according to any one of claims 1 to 18 for producing 
monoclonal antibodies suitable for passive immunization. 

25 40. Use of a DNA sequence produced by a process 

according to any one of claims 19 to 30 for producing a 
vaccine based on nucleic acids. 

41. A process for the stabilization of gpl90/MSPl 
3 0 gene sequence from Plasmodium, comprising the step of 

reducing the AT content of the sequence compared to the 
naturally-occurring sequence . 

42. A stabilized Plasmodium gpl90/MSPl gene having a 
3 5 reduced AT content compared to that of an unstabili^ed 

gene . 
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43 . A vector containing a DNA sequence according to 

any one of claims 19 to 30 and/or claim 42. 

44. A host cell containing a vector according to 

5 claim 43 . 



45. A vaccine containing a protein produced by a 

process according to any one of claims 1 to 18 and/or a DNA 
sequence according to any one of claims 19 to 3 0 and/or a 
10 host according to any one of claims 31 to 37 and/or a 
vector according to claim 43 . 



46. A vaccine according to claim 45, further 
comprising immunity-promoting products from Plasmodium. 

15 

47. A vaccine according to claim 46, wherein the 
Plasmodium is Plasmodium falciparum . 

48. A process according to claim 1 substantially as 
2 0 hereinbefore described with reference to any one of the 

examples . 

Dated this 5th day of October 2000 
HERMANN BUJARD 
2 5 By their Patent Attorneys 
GRIFFITH HACK 

Fellows Institute of Patent and 
Trade Mark Attorneys of Australia 
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from FCgT" ° f ^ (9Pl9 ° n) and of the s V nthetic g^e (g P 190s) for gpl 



AS MKIirrLCSFLFrilNTQCVXHESYQE 
5pl^° fl G A TT A T T A A A T A A T ACT A A 

gp 1 9 0 3 C SCA. ggcg7 ATGAAAATCA I , ri XuriC CTCTCTTCATTT Cl^riU 1 ~ lA TCATCAATACTCACTCCCTCACCCJ^SAATCCTATCAGGAC 



27 
90 



ASLVXKZ.EAI.EDAVLTCYSLFQICEXMVLWECT 5 7 

gpl90^ TCA A A AATGATTTTATA AA AT A A A 

gp 1 9 0 CTCCTTAAGAAACT<XIAACCTTTC<^G 1 8 0 

ASSGTAVTTS T ? G S K G S VASGG SGGSVASGG S 37 
gpl 9 0* AAT TT T AGTAT TCA T A CATTATCA. 

gp!9 0 AGTGCCACGGCCGTTACAACCACCACACCCCGTTCTAAAGGGTCTGT^ 270 

ASVASGGSVASGGSVAS GCSGN SRRTMPSDN S Ii7 
gpl90 n T T A T TCA T T T TTCA T AT TTCAA CTA TATTA 

gpl90 3 CTCCCCTCCGCCC<XLACCCTC<X^TCAGCTCGCTC^^ 



360 

ASSDSDAKSYADOKHRVRNYLLTIXELKYPOL 147 
qpl$Q T ATTAT TTTAAA AC TCTGTA AACATTACC 

gp 1 9 0 8 AGC^TTCCCACCCCAACTCCTACCCCCACCTCA^ 4 50 



AS FDLTN HMLTI#CDNIH G F X Y L IDCYEEINEL 177 

gpl90 n T TT A T A TT T T T A TAT AT T A 

gp 1 9 0 fl TTCGACCTCACTJ^TCATATC<rrCAC^ 54 0 

AS LYXLNFYFDLLRAXL NDVCANDYCQIPFN I* 207 

gp!90 n TATAACTTTTAT A A T A TATT T AT CT 

gpl 9 0 3 CTGTACAAGTTCJ^TTTCTACTTCGAC^ 630 

ASXIRANELDVLXXLVFGYRXPLDNIXDNVGX 237 

gpl90 n ATCTATAA CTAACTG AAA AT A TTAT A A 

gpl 9 0 3 AAGATCACACCCAACGAGTTGCACCrrATTG^ 720 



ASHEDYI XKNXXTISNI NELIEESXXTIDXN X 267 
g?150 n C A A A AATATATAT AGT G A A T T 

gp!90 3 ATGGAAGJ\TTAIATTAAAAAGJWCAAGAAG^ 810 



ASNATKE EE K KXLYQA Q YDI* S I YNXQLEEAS N 297 
gp!90 n T A A A A A A ATATT TT TCT AT A A T 

gp!90 3 AATCCAACCAAGGAGGAAGAAAAGAAGAAGTT^TACCACC^ 900 

ASLISVLEXRXDTLXKN EHIKELLDKINSIKN 327 
gpl90 n T A A TTAAATT TT AAA C T G TATT A A 

gp 1 9 0 3 CTCA3X1\GCCTACTC<IAGAAGCK^TAG^ 990 

ASPPPANSGNTPNTIiLD K N K K I EEHEXEIXE I 357 
gpl90 n CAG T ATAATTCTT AAC A A A A AT 

gpl90 3 CCTCCGCCACCCAACTCTGCCAACACCCCTAACACGCTCCTC^ 1080 

ASAKTIKFNI DSLFTDP LELEY YLREKHKHI 0 387 
gpl90 n T A T T T AG T A A AT A A T A AA A TT 

gp 1 9 0 3 GCCAAAACCATTAAGTTCAACATAGAXTCTCTCTTTACTGATC 1170 

AS I SAKVETKESTEPN2 YPHCVTYPLSYNDIN 417 
gpl90 n AAGT ACTA TC AATTTTA T 

gpL90 3 ATCTCCSCCAAAGTCGJ^CAAAGGAATCAACCGA^ 1260 

A3NALNELNSFGDLIM? FDYTKEPSXNIYTD N 447 
gpigO 11 T T A T A T TCT T T A TAT A AAG A C A T T T 

gp!90 S AACCCTCTCAACGJkGCTCAATACCTTCCCTC^^ 13 50 

ASERKXFINE IKEKIXI EKXKI ESDXXSYEDR 477 
gpl9Q ;i A AACAT T AAT A A A A ATC T A TC A A 

gp 1 9 0 3 GAGAGAuAAGAACTTTATCAACClAAATCAACO^ 14 4 0 

ASSXSLNDITXEYEXI.L NEIYDSXFNNNIDLT 507 
gp!90 n TCT CTC TT AA A AT ATTAT AG T TATTAT 

gp 1 9 0 3 AGCAAAAGTCTAAACCATATCACTAAAGAGTATGAAAACCTG 15 3 0 

ASHFEXMMCKRYSYXVE KLTHBNTFASYENS X 53 7 
5?190 n TA TAATATT T T A A 

gp 1 9 0 3 AACTTCCAGAAAATCATGGCAAAACCOT 1620 
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AS 

gpi90 n 
gpi9o 3 

AS 

gpl90 n 

gpL90 s 



AS 

gpi90 n 
gpl90 3 

AS 

gp!90 n 
gp!90 3 

AS 

gp!9C a 
gpl90 3 

AS 

g?190 n 
gpl90 3 

AS 

gp!90 n 
gpl90 3 

AS 

gpl90 n 
gpl90 3 

AS 

gpl90 n 
gpl90 3 

AS 

g?190 n 
gpl90 3 

AS 

gpl90 n 
•«TP190 3 

AS 

gpl90 n 
, g?190 3 

AS 

g?190 n 
gp!90 fl 

AS 

gp!90 n 
gp!90 3 

AS 

gpi90 J 
gpl90 ; 



H N X* E K T titet *•« 

A - A A / " 2 ° Y 3 L R N Z V V E K - ^ K Y y 

A A ~ TAATAATA 7 A ~ 

. - CTTAAGT ATATGGACCACTATTCTCTGC CGAACATTCTTCTCCACAAAGAACTAAACTACT 

, a":« IS c K * I ;" i E: , !?:,£N!J * ° e e 0 l r e k 
"""""•aagx*^^ 

T T KDE »*PBS K I L«»SDIWI t V0VQlcvLL» lHlt 

1 0 C* £ A* A* ' ° £ c 1 £ A * / ^ * » » 18 V * " S ' * 0 « H 

* V WAVoSoV ° * ' " V ' M ' " 1 " ' * " « 

" a" a" 1 A K T a" ° ,° A * C D " " ' S * 8 = E X T G Q A T ? K p 

S Q WaV T A~ V ^ A ^ A Q E Q X QAQPPVPVp 

V A ? " A A A K A A Q V c" J ' ? * * V * » * * * » V S K L 0 V L S K 
«««™«»^^ 

/ : . S A ? . I l C , 1 K Y 1 J; v s a 3 T » H E * I L X Q v 

CTCTAZ^CTTCCT^AIACATCCTA^C^CACAAATAIA^ 

" A" T T A" E <T » E »* / L S S = ° P L ° L L F » 1 Q ■ * I P V H » 
OAAC T AAGT A t » T AT ■« i i ~ , _ . 

':'•'•»' "lei*'*' 1 Y E K E M V c N L * K L * 

^ ^ A^ ^ A^ TT A** A^ " * A^ X S V K I»SSSSMQPI, 
«CAAC=ACAA^AA=AAC=^^^ 

S a^a T ? Q G 3 / A ? E " S A N ° ° " S 8 S - « I- » « S L X I. F E 

" X « ac'-VaVa"- 1 ** ° * L 1 6 0 * S S E " r v 5 K 1 «• * 

-XAgA -— r -„ - . .,.*....*. A C A " A TAATA AGTAGT A T T A - A 

AACATCCKTCTCTCCCC^AGAATAACA^TCTACCAACAACrrAT^^ 

\\V V VaIctVaV// • « A 0 O E 3 K , 

«CAC==A=ACA TOA rA^GAGACCTrCA^^ 




AS 

gp!90 n 
gpl90 3 

AS 

gpl90 3 

AS 

gpX90 n 
gpl90 8 

AS 

gp:90 n 
gpl90 fl 



i A F T° / A* * X T V , 6 - 8 A ? * / « 1 * *t T I- I * » O L E S X I, H 
<^«««AAG«CACAC^^ 

S VtVaW - A V V 7 ' ^ " * * * ' A = X * = X E H T 
^ACTGAACAATCCCAAACACGTACTGCACAAC' w T w ~*T'ar-^X*«v---f»^'» AAA TAAAATA A 

~w~-«™- . i » .tjotu. iCAACAACAAGAAGGAACCCGACATCGCCGAGACACAGAACACT 

a' A" ? A X A' A~ J C K ' - V / G L V K Y - K = S S S » * ~- S . E 

™««*=«CAA« 

1 AXCA 1 T ° A ? A S A ° " T Y / 5 ^ * » ' " " I » K L = C X L K D H t H 
CACAC^TCCACACC^TAACrAiccCA^CTCC^CTT^^ -* ^ 



567 
1710 
597 
1B00 
627 
1390 
657 
1980 
637 
2070 
717 
2160 
747 
2250 
777 
2340 
807 
2430 
837 
2S20 
967 
2610 
897 
2700 
927 
2790 
957 
2880 
987 
2970 
1017 
3060 
1047 
3150 
.1077 
3240 
1107 
3330 
1137 
3420 
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AS 

gpl90 n 
gp!90 a 

AS 

<jpl90 n 
gpl90 a 



T A A A AT ATCA TAAT T A TT ATT AT A A A A A * * 

CTC^ACA^CA^CCT^ 

C / ^ S T E / * * fj* K ^ A , E S r X K F t P E c T 0 V a T 
CKXAATAGCCCAA^GAGAATAATACAGACCT^ 




AS 

gp'SO 8 
AS 

gpl90 a 
gplSO 3 

AS 

gpl90 n 
gpl90 a 

AS 

gp!90 n 
gpl90 8 

AS 

gpl90 n 
g?130 a 

AS 

gpi90 n 
gpl90 a 

AS 

gp!90 n 
gpl90 fl 

AS 

gpl90 a 
AS 

gpl9 0 n 
. gp!90 a 

AS 

gpl90 n 
gpL90° 

AS 

gpl90 n 
qpL9Q* 

AS 

gpl90 n 
gpl90* 

AS 

gpi90 n 
<jpl90 3 

AS 

gpi9G n 
gpi90 3 



AA A AA A 

ACCACA:rcrrcACAJvcGTcc^ 



Q A V x V A T , C J A V T ? S V 1 ° N 1 - 3 K I E N E Y E V L Y L K 

AAAAAAAA a A""*" TAT 

CAGCraTCACCCCTCACOT^ 

TT AACTAA AT A A ? AT ^ T T T ~ A t4U» 

CCTCTCGCACGCGTtrTATACCTCT^^ 

R a" " " £ T E " ' X a\ V T L A" T L / V * ° A' A' S " " " " K 
C=*^^AAGACAGAAA^ 

V Va^V " E a X A R \° ^ ^ ^* ^ S Y NY ^KOSIDTDIN 

1 2 * AAA CT AAGC t»i> t ^AATG A 

GACCC^TACAAGTTCCTCAATAAACJUSA^^ 

F^A NDVLGYYKILSEKYKS DLDSIK- XYI NDK 

^ A TTA TAATATC T A A TT ATA A C A 

QC E NEKYLPFLMNIETLY KT VNDKIDL F V I 

* G CT TTACTTG TATA TTT T T m A T 

CAAC^CAGAAJTGAAAAATJV^^ 

ff J: a "/ KVLNYTYEKS NVEVKI KELNYLXTIO 
CAC^GAG*— A T A T AT ATCA CA AAATTTA T 

OIC s i AOrKKNNHrVCIADL STDYNHNNLLTK 
TA t T TT AAA *** T C* A*" A 

F C L -IV A C 7 / , E N L A ^ T V L S H L L D G N L Q G M L N I 

C .ACT AT . T TTT C TT ATCT TATTAT AT T A T 

CCTCC^GCCAACCTC<JU^^L\TCCTC^ 

S OHQCVKKQCPOMSGCFR HLDSREfiCXCLl 
* A A ATAATCTA A TATAAA ATA T AT A 

N T Y!CQEG T DK A C J',, SN PNPTCM EN NGGCDAOAKC 
AJUrrAJ^AACJ^CAACCAG^ 

r-S EDSGSNGXKITCECTlCPDSYPLFDGIFC 
A TTCA TAGC ^ A A ^ <T . T T . 

ACCGACX^VGACACCC^rTCTA^ 



1167 
3510 
1197 
3600 
1227 
3690 
1257 
3780 

12 8 7 
3870 
1317 
3960 
1347 
4050 

13 77 
4140 
1407 
4230 
1437 
4320 
1467 
4410 
1497 

4500 

1527 

4590 

1557 

4680 

1537 

4770 

1617 

4860 



1639 



SSSNFLGIFTL1.ILMLILVS 

iS^.^^.f L-—- _^ CA T AT A A T A AT A * T* 

^ ; ^ --uCTCATCCT^T<^TGATCCTCT ACAGCTTCATCT AATAf; ATf p flr rsr. 4 940 

stop codon cia 1 
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